Skip to content

Add attention kernels optimized for arm's i8mm instruction. #942

Merged
copybara-service[bot] merged 1 commit into
devfrom
test_938467018
Jul 3, 2026
Merged

Add attention kernels optimized for arm's i8mm instruction. #942
copybara-service[bot] merged 1 commit into
devfrom
test_938467018

Conversation

@copybara-service

Copy link
Copy Markdown

Add attention kernels optimized for arm's i8mm instruction.
They give about 8x higher throughput compared to previous i8 implementation.

@copybara-service copybara-service Bot force-pushed the test_938467018 branch 9 times, most recently from 8d2f809 to 27b2995 Compare July 3, 2026 15:04
They give about 8x higher throughput compared to previous i8 implementation.

PiperOrigin-RevId: 942147808
@copybara-service copybara-service Bot merged commit 02548ac into dev Jul 3, 2026
@copybara-service copybara-service Bot deleted the test_938467018 branch July 3, 2026 15:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants